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Abstract 

The analysis of observed conditional distributions of both lagged and simul- 
taneous intraday price increments of a basket of stocks reveals phenomena 
of dependence - induced volatility smile and kurtosis reduction. A model 
based on multivariate t-Student distribution shows that the observed effects 
are caused by collective non-gaussian dependence properties of financial time 
series. 



Corresponding author. E-mail leonidov@lpi.ru 



1 



1 Introduction 



One of the fundamental problems of quantitative finance is to develop a 
description of collective dynamical properties of market prices of an ensemble 
of financial instruments. 

Let us stress that a problem of working out an economic description of 
the properties of market prices is not completely solved even at the level of 
individual securities. A simple and very popular dynamical picture allowing 
transparent analytical treatment, that of a random walk, assumes a) normal 
distribution for the price increments and b) independence of price increments 
corresponding to different time intervals. Starting from the studies of Man- 
delbrot in the 60 'th [T| through more recent analysis (21 H3 El EJ E| there 
accumulated a large body of evidence that real price dynamics for individual 
securities reveals substantial deviations from both assumptions. 

Obviously we discover a much higher complexity when moving from a 
single financial security to a basket of securities. At this level we expect to 
deal with such novel effects as a) specific non-gaussian properties of the mul- 
tivariate distribution of price increments [UE]; b) temporal autocorrelations 
in price changes of single securities [31 El mixed with simultaneous cross- 
correlations between price increments of different basket ingredients [31 [7] . In 
fact, collective price dynamics is characterized by pronounced non-gaussian 
properties and a complicated web of interdependencies. 

In this study we apply a conditional distribution approach to scrutinize 
the dependence structure within an ensemble of financial instruments and the 
related nongaussian effects. Analysis of the value of "response" conditioned 
on the "input" having a certain magnitude enables to explicitly quantify the 
dependencies in the market data. Generically, dealing with a set of securities 
and following its temporal evolution, we can identify two types of conditional 
distributions which are of interest to us: a) distribution of a future price in- 
crement given that past price increments of all securities lie in a certain 
range; b) distribution of a price increment given that all other price incre- 
ments in the same time interval lie in a certain range. A simple example of 
the phenomenon described by the former distribution is the lagged autocor- 
relation, of the latter - the simultaneous cross-sectional correlation. Let us 
stress that separating time-lagged dependencies ("horizontal" for further ref- 
erence) from simultaneously existing ones ("vertical" for further reference) 
is a simplification of the generic picture which allows, however, to discuss 
various types of dependencies in a simple setting. A generic dependence 
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pattern is a "product" of both: past evolution of a subset of securities may 
influence future evolution of another subset. An importance of these generic 
"non-diagonal" contributions was studied, in the context of profitability of a 
simple contrarian strategy, in [S|. Let us also mention the recent studies of 
lagged conditional distributions of daily returns 01101, i n the latter reference 
- in relation to a particular stochastic volatility model. 

Analyzing, in terms of conditional distributions, the market data on in- 
traday price increments of a large set of liquid stocks traded in NYSE and 
NASDAQ we have found pronounced specific effects characterizing the con- 
ditional dynamics of price increments for both lagged and simultaneous types 
of dependence. Most spectacular is a relationship between the volatility of 
the "response" increment and the magnitude of the "input" one which can 
in simple terms be described as a dependence-induced volatility smile ("D"- 
smile). Another striking feature seen in the data is a dramatic reduction of 
the kurtosis of the conditional distribution of the "response" increments. 

To give a quantitative interpretation of these results we have developed 
a model description of the corresponding conditional distributions based on 
a multivariate non-gaussian t-Student distribution depending on both past 
and future price increments. Let us note that a multivariate t-Student distri- 
bution is a popular choice for analyzing the simultaneous [I] and lagged [TT] 
correlations in financial dynamics. The non-gaussian nature of the model 
turned out to be a key element enabling to explain the dependence struc- 
tures observed in the market data. In particular, conditional volatility smile 
and decrease of kurtosis take place even in complete absence of linear correla- 
tions. The above-described effects completely disappear, however, if one uses 
a multivariate gaussian distribution depending on the corresponding matrix 
of covariances (correlations) instead of the fat-tailed multivariate t-Student 
distribution. 

2 Observed features 

The object of our study is a dynamical evolution of a group of N = 100 most 
liquid stocks from S&P 500 2 within a two-year time period from January 
1, 2003 through December 31, 2004, characterized by the price increments 
5p(t) in the time interval of length r. In our analysis we use two intervals 
of length t = 6 min and r = 60 min. For an interval [t, t + r] we thus 

2 A list of stocks is given in the Appendix 
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have a configuration of N price increments {8p>(t) = p>(t + t) — p>(t)}, 
j = 1 • • • N, evolving in time. Most interesting are, of course, the features of 
this evolution distinguishing it from that of a group of independent objects. 
Such cohesion can be of both simultaneous (interrelations between the values 
of price increments of different stocks in the same time interval) and lagged 
(interrelations between the price increments of the same or different stocks 
in different time intervals) nature. 

Below we shall concentrate on the two simplest types of dependencies: 

1. Interrelation between the price increments in consecutive time intervals 
for the same stock ("horizontal" case) 

2. Interrelations between the price increments of different stocks in the 
same time interval ("vertical" case) 

Let us start with "horizontal" case and consider all pairs { 5p>(t), 5p>(t + 
t)} of stock price increments in two consecutive time intervals for some given 
j-th stock. Our goal is to describe probabilistic properties of the set of incre- 
ments at time t + r conditioned on the sign and magnitude of the increments 
at preceding time t. These properties are characterized by the corresponding 
conditional distribution constructed as follows: 

• First, we normalize the price increments Sp>(t) in the first interval of the 
pair by their unconditional standard deviation of ot , 5p>{t) — > af(t) = 

¥(t)l°L 

• Second, we divide the set of thus normalized increments into subinter- 
vals Aj having the fixed length 0.5. The total interval we consider is 
A = [—3.25, 3.25]. The subinterval Ai thus corresponds (for j-th stock) 
to x j E [-3.25,-2.75], etc. 

• For a pair with x J belonging to some fixed subinterval Aj we study 
the conditional distribution VA,(y^) of the normalized price increments 
yi = 8jP (t + t) j a{ ot in the second interval of the pair 

V Ai (y^=V(y j \x j e A t ) (1) 

The distribution (JIJ is then a "horizontal" coarse-grained conditional distri- 
bution 3 . 

3 Coarse graining refers to conditioned variable x belonging to some fixed interval Aj: 
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The basic properties of the conditional distribution VaXv) are conve- 
niently summarized by the values of its lowest moments - mean fi con d, stan- 
dard deviation cr con d, anomalous kurtosis K con d, etc. . In this paper we shall 
study the correspondingly normalized conditional mean, conditional standard 
deviation and conditional anomalous kurtosis. The above-described normal- 
ization allows to consider all stocks simultaneously. The normalized mean 
A*cond/ cr tot) standard deviation cr cond /cr tot and anomalous kurtosis K C ond/ K tot, 
where K to t is an unconditional anomalous kurtosis of the increments' distri- 
bution, of the "horizontal" coarse-grained conditional distribution (JTJ) (i.e. 
that characterizing the set of all adjacent 6-min. intervals for each stock) are 
plotted as a function of the rescaled initial push 5p/a tot in Fig. [T]. 

Let us now turn to the analysis of the "vertical" interrelations between 
simultaneous price increments of different stocks The corresponding coarse- 
grained conditional distribution is constructed in complete analogy with the 
above- described "horizontal" case: 

V^{v j ) = V(yi\x k e A,), (2) 

where the conditioned variable x k = 5p k (t)/a k ot refers to the k-th stock, and 
the response variable y J = 5p^{t)/a{ ot - to the j-th one. 

In Fig. El we show the normalized conditional mean, standard deviation 
and kurtosis for 6-min. intervals for the "vertical" case. 

In Fig. El we plot the medians of the scatterplots for the normalized con- 
ditional mean, standard deviation and kurtosis for 6-min. and 60-min. in- 
tervals, combining the "horizontal" and "vertical" quantities. 

The analysis of Figs. ^ 121 and 01 leads to the following conclusions: 

• The resulting plots for conditional mean /i con d m the "horizontal" case 
are too noisy to allow unambiguous interpretation. In the "vertical" 
case one observes, for both cases of r = 6min and r = 60min, a 
picture consistent with that of conditional mean generated through the 
presence of positive correlation, see below Eqs. (0) and (fTTj) . 

• The plots of the relative conditional standard deviation cr con d in "hor- 
izontal" and "vertical" case are, for the both cases of r = 6min and 
r = 60 min, strikingly similar. For r = 6 min we observe a pronounced 
conditional volatility smile, or dependence-induced volatility smile (D- 
smile) (see a more detailed discussion of this phenomenon in the next 
section), such that at small x the standard deviation of the response 
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is smaller than the unconditional standard deviation, while in the tails 
it is, on contrary, larger. For r = 60 min the smile is noticeably flatter 
than for r = 6 min. This effect can be explained by the decay of anoma- 
lous kurtosis of price increments with growing r, see below discussion 
after Eq. ijTT) . 

• The median conditional kurtosis is noticeably smaller than the uncon- 
ditional one. 

We see, that in both "vertical" and "horizontal" cases the data shows, for 
both scales of r = 6 min and r = 60 min, the same rather nontrivial patterns: 
conditional volatility smile and decrease of conditional kurtosis. The origin 
of the first effect is discussed in the next section. We shall argue, that it is in 
the probabilistic dependence of the adjacent price increments, whereas the 
role of linear correlation effects is in fact minor. 

3 Model 

Let us now present a model that explains the phenomenona of dependence- 
induced volatility smile and kurtosis reduction in the coarse-grained condi- 
tional distributions described in the previous section. 

At the fundamental level of description the model describing the behav- 
ior of N securities in two adjacent time intervals is fully specified by a 2N 
- dimensional probability distribution. The focus of our study is on the 
properties of the conditional distributions constructed from this basic en- 
veloping 2iV-dimensional distribution. Generically conditional distributions 
are obtained by restricting the values of a subset of variables. Let us col- 
lectively denote these variables by x, where x is a iV x - dimensional vector. 
Generically the vector x can include increments belonging to different time 
intervals. We are thus dealing with a conditional distribution depending on 
iV y = 2N — iV x variables. If we stay within the class of elliptical distribu- 
tions, the multivariate probability distribution is a function of a quadratic 
form K, constructed from the vector z T = (y, x) and the generalized covari- 
ance matrix E, /C = z T -£~ 1 -z. The covariance matrix E includes the 
N x x N x covariance matrix C x describing the correlations within the subset 
of conditioned variables x, the iV y x N y covariance matrix C y describing the 
correlations within the subset of the variables y and the N x x iV y covariance 
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matrix C xy describing the cross-covariances between the two groups: 



C y C xy 

C X y 



(3) 



At this stage we have to give an explicit description of the multivariate 
distribution containing the covariance matrix E. As will be elucidated below, 
a simplest choice of a gaussian multivariate distribution does not allow to 
explain the phenomena of D-smile and kurtosis reduction. There is, therefore, 
a clear need of taking into account the non-gaussian effects. The simplest 
possibility of keeping a fat-tailed nature of the probability distributions of 
individual increments is to construct a multivariate distribution from the 
fat-tailed marginals. Recombination of these marginals into a multivariate 
distribution requires constructing an appropriate copula. This construction 
is not unique, so the choice is guided by simplicity and ability to reproduce 
basic features of market data ^21 in what follows we will show that a 
multivariate t-Student distribution makes a good job in this respect, while 
the Gaussian multivariate distribution fails to reproduce the properties of 
conditional distributions observed in market data. 

Let us consider a 2iV-dimensional t-Student distribution 
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where £ M = (/i— 2) //i is a normalization factor ensuring, in particular, that the 
covariances computed with the distribution (jlj) are equal to the corresponding 
matrix elements of the matrix E. 

Fixing some particular configuration of the "initial" increments x = x 
leads to the conditional distribution (see, e.g., [Ej): 
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The conditional distribution (jSJ) is a multivariate N y - dimensional t-Student 
distribution with the index fi + iV x and the following expected mean and 
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covariance matrix: 



(y)xo 
v 

L -'y\ xo 



^xyC x 1 Xq 



(Cy — C X yC x C xy ) 



/i + iv x 



/Km 



(6) 



where /C Xo = x C x x . Let us note, that if we had used the Gaussian 
multivariate distribution for constructing the conditional distribution anal- 
ogous to ©, we would obtain a Gaussian conditional distribution with the 
following expected mean and covariance matrix: 



(y)x 

y| xo 



^xyC x 1 Xo 

y — L-xyL-x Wy 



(7) 



Comparing Eqs. © and J7J) we see, that the expected mean is in both cases 
the same, whereas the expected variance in the t-Student case is a product of 
the gaussian expression and a (i- and /C x - dependent factor. An additional 
important phenomenon in the case of a t-Student distribution is an increase 
of the tail exponent determining the fat-tailedness of the distribution: (i =>■ 
H + N K that thereby reduces the anomalous kurtosis 4 
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(1—4: 



K 



(i + iV x - 4 



(8) 



To describe the conditional volatility smile phenomenon discussed in the 
previous section, one clearly needs initial conditions' depending covariances. 
From the formula (J7J) we see that in the Gaussian case this effect is ab- 
sent, whereas for t-Student distribution the required dependence is manifest 

Of 



xo 



(see the second expression in (JHJ) containing the factor of 

course, one should still prove that this dependence allows to describe the 
market data, see below. Nevertheless, already at this stage of our analy- 
sis, one can conclude that the phenomenon of conditional volatility smile 
can be explained only by non-gaussian effects - simply because the gaussian 
formalism does not have room for its description. 

The conditional distribution Eq. (|SJ) summarizes the impact the "initial" 
configuration xq has on the "final" one y. 



4 Note that the extent of this " gaussization" depends on the number of conditioned 
variables which in the considered example is equal to N x . 
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The explanation of the conditional volatility smile and kurtosis reduction 
effects described in the previous section requires a simpler 2-dimensional 
version of (j3J) with one-dimensional y and x. Let us thus consider two price 
increments in the two consecutive time intervals for the same stock for the 
"horizontal" case (or the simultaneous increments of two stocks for the "ver- 
tical" case) and introduce the corresponding bivariate distribution 

jf'M = , 1 T -W- ! 0) 



2 



Here £ is a covariance matrix 



o-y cr x a y r 
a x a y r o 2 x 



(10) 



and Ky, = (y,x) ■ S" 1 • (y,x) T The conditional distribution V(y\ x = xo) 
corresponding to the above distribution is again a t-Student distribution 
with the tail exponent fi + 1, conditional mean (y) Xo and conditional Xq - 
dependent variance ct^i 

(v)xa = rx o 

<4. = "^-^^{^-.Tf] (") 



Therefore the conditional distribution is more gaussian (the ratio of its anoma- 
lous kurtosis to the unconditional one is equal to (/x — 4)/(/i — 3) < 1), but its 
standard deviation can be smaller or larger than the unconditional value a y 
depending on the value of the conditioned variable Xq. The parabolic depen- 
dence of the conditional volatility on the initial push xq is just the feature we 
need to explain the D-smiles in Figs. ^ El and 01 The fine structure we have 
observed - namely, the flattening of the D-smile with growing r, can also be 
explained with the help of Eq. pijl . Indeed, the coefficient at Xq is equal to 
l/(/i^) = l/(/i — 2). Now the data shows (see below Fig. EJ) that for larger 
time intervals the unconditional anomalous kurtosis k t is smaller, and the 
tail index \i T = 4 + 6/n T is, correspondingly, larger, leading to the desired 
flattening of the smile. The unconditional anomalous kurtosis k t and the 
corresponding tail index \i T are plotted for the ensemble of iV = 100 stocks 
considered in the paper for several intraday time intervals, in Fig. 
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Let us also note that in the gaussian case one has (y) x = r-x and cr^. = 
&y(l — r 2 ) so, as has been already mentioned, the gaussian probabilistic link 
between the price increments does not leave room for xq - dependent effects 
in the conditional covariance matrix. 

To make the correspondence with the market data quantitative we should, 
however, introduce a coarse-grained version of the conditional distribution 
V(y\ x G A), where the variable x belongs to a certain subinterval A: 



We have computed the normalized mean, relative standard deviation and 
anomalous kurtosis of a set of conditional distributions corresponding to the 
same coarse- graining of the increments x = 5p(t)/a tot as used in the analysis 
of the market data in the previous section, tail index /x = 5 and a set of 
correlation coefficients r = 0.25, 0.5, 0.75. The conditional mean is, of course, 
simply proportional to Xq. The conditional kurtosis drops to the expected 
k = 3, with small deviations. Most interesting is, of course, the behavior of 
the conditional standard deviation shown in Fig. El We see that the model 
reproduces the conditional volatility smile with characteristics very similar 
to those observed in the market data. 

A crucial point in the correct interpretation of the above result is that 
linear correlation (present through the correlation coefficient r) shows itself 
only via setting the absolute scale for the variance, see the second of Eq. (jllj) . 
It is clear, that the conditional volatility smile would be present even in the 
complete absence of correlations (r = 0). Therefore it is really appropriate 
to call the volatility dependence in question a dependence-induced volatility 
smile (D-smile). Considering for instance the "horizontal" case, the proba- 
bilistic dependence between the increments Sp(t) and Sp(t + 1) can be mani- 
festly demonstrated by computing, e.g., the correlator of their absolute values 
G(l) = (| 6p(t)\ ■ | 5p(t + l)\) t -((\ 5p\) t ) 2 ). Calculating this correlator for the 




(12) 
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bivariate t-Student distribution Q and its Gaussian counterpart gives 




Arcsin r 



) 



r 



+ 



(13) 



(14) 



In the gaussian case the (linearly) uncorrelated variables are also indepen- 
dent and, indeed, the correlator ()13|) vanishes as r 2 at r — > 0. In the case of 
t-Student distribution the correlator ()14)) is, on contrary, nonzero at r = 0, 
so increments are in this case probabilistically dependent. Let us stress that 
this dependence is in fact imposed by the form of the unconditional distribu- 
tion we have chosen. One crucial feature is that the t-Student distribution 
ensures, in agreement with observations, that the corresponding marginal 
distributions are fat-tailed. The t-Student copula we have used provides a 
framework in which the dependence effects are present even in the complete 
absence of linear correlations. 

4 Discussion 

There still remains a number of important issues related to the questions 
discussed in the paper that we leave for the future analysis [To] . 

First, one would like to generalize the binary- level description of simulta- 
neous "vertical" interdependence of stock price increments to the fully mul- 
tivariate case of the influence of the n-point "trigger" configuration {Spj(t)} 
on the move of the fc-th stock in the next time interval 5pj(t + r). 

Second, perhaps more difficult issue is studying the properties of the 
conditional distributions for arbitrary separation of corresponding time in- 
tervals. Preliminary analysis of the market data shows the dependence of 
D-smile on this separation ("maturity"). This is to be expected from the 
fact that volatility autocorrelations decay, albeit slowly, with time. This 
forces to generalize the formalism we have used 5 . In any case, a big goal is 

5 For an example of a construction of this sort see • 
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to establish connection with the explicit models of volatility dynamics, see 
e.g. [EJ[T7], including the leverage effects jlj. 

Finally, we would like to analyze in more details application of the nonlin- 
ear patterns we have described to portfolio optimization problems. Expected 
mean, volatility and degree of fat-tailedness are crucial ingredients of portfo- 
lio optimization schemes jU [7j , so specific effects related to them are of clear 
interest in this context. 

5 Conclusion 

Let us summarize the main results of the present paper. 

The focus of our analysis is on the properties of conditional distribu- 
tions characterizing the probabilistic behavior of an ensemble of financial 
instruments. The analysis of market data in the simplest case of a binary 
probabilistic dependence has revealed two major effects: 

• The smile-shaped dependence of conditional volatility on the mag- 
nitude of the input due to non-gaussian nature of the enveloping t- 
Student distribution 

• A noticeable reduction of the conditional anomalous kurtosis as com- 
pared to the unconditional one 

Let us also mention the flattening of the D-smile with growing time interval 
on which the price increments are computed. 

We have constructed an explicit model characterizing the collective prob- 
abilistic pattern of an ensemble of price increments that gives a natural expla- 
nation of the above-listed phenomena. The model is based on a multinomial 
t-Student distribution. This theoretical framework allows to unambiguously 
relate the effects of a dependence-induced volatility smile and kurtosis reduc- 
tion to the non-gaussian nature of the eneveloping distribution. 
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6 Appendix 



Below we give a list of stocks studied in the paper: 

A, AA, ABS, ABT, ADI, ADM, AIG, ALTR, AMGN, AMD, AOC, APA, 
APOL, AV, AVP, AXP, BA, BBBY, BBY, BHI, BIIB, BJS, BK, BLS, BR, 
BSX, CA, CAH, CAT, CC, CCL, CCU, CIT, CL, COP, CTXS, CVS, CZN, 
DG, DE, EDS, EK, EOP, EXC, FCX, FD, FDX, FE, FISV, FITB, FRE, 
GENZ, GIS, HDI, HIG, HMA, HOT, HUM, JBL, JWN, INTU, KG, KMB, 
KMG, LH, LPX, LXK, MAT, MAS, MEL, MHS, MMM, MO, MVT, MX, 
MYG, NI, NKE, NTRS, PBG, PCAR, PFG, PGN, PNC, PX, RHI, ROK, 
SOV, SPG, STI, SUN, T, TE, TMO, TRB, TSG, UNP, UST, WHR, WY 
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Figure 3: Medians of normalized mean, standard deviation and anomalous 
kurtosis of the coarse-grained conditional distribution versus the initial push, 
t = 6min ("horizontal": red line, "vertical": blue line) and r = 60min 
("horizontal": green line, "vertical": black line). 



ANOMALOUS KURTOSIS AND TAIL INDEX 




Figure 4: Unconditional anomalous kurtosis k t (red line) and corresponding 
tail index of the t-Student distribution fi T (blue line) for several intraday 
time intervals r. 



CONDITIONAL STANDARD DEVIATION 




Figure 5: Normalized standard deviation of the model coarse-grained con- 
ditional distribution versus the initial push; red: r = 0.25, blue: r = 0.5, 
green; r = 0.75 



